But I’m not qualified to be a data scientist!

How to transition from university to the data science industry

Erika Braithwaite

2022-03-09

Welcome!

So you’re thinking about leaving the comforting womb of university, and transitioning to a data science career in industry? You may be feeling….

Why

Objectives

I’ll do my best to address these objectives in the most data-driven approach I can!

A bit about me

I’m the CEO of a health data science startup in Montreal www.precision-analytics.ca

My background

Our company

Our story

What is a data scientist

A combination of programming, statistics and domain knowledge.

I like to think of it as story telling using data

A very scary (and real) job posting

Myths about data science

Letting the data speak!

To tackle these myths, Kaggle conducts an annual survey of data scientists. Over 25,000 respondents completed the survey in 2021

‼️ Kaggle is a platform that hosts machine learning competitions. It’s users are not representative of the entire data science community ‼️

Data available for 2021

Demographics of respondents

Types of data scientist positions

Myth 1. You need a PhD to become a data scientist

Experience of respondents

Myth 2. Primary tools used at work or school

Myth 3. Data science == Machine learning and AI

Myth 4. Data science challenges are mostly analytical

The survey did not explicitly ask the way respondents broke up their time. So I did the next best thing… ask twitter

Data scientists challenges

In 2017, the Kaggle survey asked respondents about the “biggest challenges in data science”.

From here we can see that most people identify “dirty data” the toughest part of the job. The rest of the issues seem be organizational.

On transitioning

On transitioning

Understand your new “audience”

💰 The currency in academia versus the currency in the private sector 💰

Gaining more experience: Courses

The American Statistical Association held a two-day Data Science summit. 72 educators, researchers and practitioners in statistics, mathematics, computer science, and data science from academia, industry, government and nonprofit gathered to put forth recommendations for future data science programs.

Data science courses should expose students to:

Introduction to statistics
Data analysis in the real world
Math and algorithms
Answering real problems
Expose students to modern tools
Teach data ethics
Active learning

I’d add to this list: study design. Knowing how data was generated/collected will always inform the types of questions you can ask, and appropriate method of analysis.

Build a portfolio

Showcase your work

Local meetups

RLadies, Data For Good, PyData

Internship

Don’t forget to reach out to your network and tell people you’re looking to get into a new industry

Preparing your cover letter and CV

More on CV’s

A few tips (based on my personal pet peeves)

Searching for a position

During the interview

Jacqueline Nolis’s book Build a Career in Data Science

“I want to hear about a project they’ve worked on recently. I ask them about how the project started, how they determined it was worth time and effort, their process, and their results. I also ask them about what they learned from the project. I gain a lot from answers to this question: if they can tell a narrative, how the problem related to the bigger picture, and how they tackled the hard work of doing something”.

Asking for feedback after your interview can help highlight the interviewer’s perceptions of your strengths and weaknesses

Employer fears

In general, employers are concerned that new hires without job experience will struggle to see the bigger picture and the pace.

Try to address these fears by addressing them directly in your cover letters, CV’s and during the interview

Conclusion

Pivoting to data science from humanities, social sciences or any other disciplines doesn’t mean you need to leave all of your training behind.

☑️ You have knowledge and/or experience in at least one of these circles

☑️ Look for people in your substantive area who are doing some more quantitative/tech driven work

☑️ Be confident that you know how to learn

✔️ Be ready to start at the bottom

☑️ Be ready be creative in your search and to be a hustler

✔️ Your path is the right path

New directions (my hot takes)

A big thanks!

I can be found on twitter @ea_braithwaite

Come check out our website www.precision-analytics.ca

This presentation & R code can be found online https://github.com/precision-analytics/CSCDS.

Simply download the entire repo via “clone/download” button; the presentation is the html file

Happy to answer any questions